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@ Expression of protein C. 

© Genomic and cDNA sequences coding for a protein having 
substantially the same biological activity as human protein C 
and recombinant expression vectors comprising these sequen- 
ces are disclosed. Methods are disclosed for producing a 
protein which has substantially the sarne biological activity as 
human protein C, The protein, which may be in the form of 
activated protein C, is produced by mammalian host cells 
transfected with a plasmid capable of Integration in mammalian 
host cell DNA. The plasmid includes a promoter followed 
downstream by a DNA sequence which encodes a protein 
having substantially the same structure and/or activity as 
human protein C. the nucleotide sequence being foHowed 
downstream by a polyadenylation signal. 
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Description 

EXPRESSION OF PROTEIN C 



Technical Field 

5 The present invention relates generally to plasma proteins and DNA sequences encoding them, and more 
specifically, to the expression of. proteins having substantially the same structure and/or activity as human 
protein C or human activated protein C. 

Background Art 

10 Protein C is a zymogen, or precursor, of a serine protease which plays an important role in the regulation of 
blood coagulation and generation of fibrinolytic activity in vivo . It is synthesized in the liver as a single^chain 
polypeptide which undergoes considerable processing to give rise to a two-chain molecule comprising heavy 
(Mr -= 40,000) and light (Mr — 21,000) chains held together by a disulfide bond. The circulating two-chain 
intermediate is converted to the biologically active form of the molecule, known as "activated protein C" (APC) , 

15 by the thrombin-mediated cleavage of a i2-residue peptide from the aminoterminus of the heavy chain. The 
cleavage reaction Is augmented in vivo by thrombomodulin, an endothelial cell cof actor (Esmon and Owen, 
Proc. Natl. Acad. Scl. USA 78: 2249-2252. I98l). 

Protein C is a vitamin K-dependent glycoprotein which contains approximately nine residues of 
gamma-carboxyglutamic acid (Gla) and one equivalent of beta-hydroxyaspartic acid which are formed by 

20 post-translational modifications of glutamic acid and aspartic acid residues, respectively. The post-transla- 
tional formation of specific gamma-carboxyglutamic acid residues in protein C requires vitamin K. These 
unusual amino acid residues bind to calcium ions and are believed to be responsible for the interaction of the 
protein with phospholipid, which is required for the biological activity of protein C. 

In contrast to the coagulation-promoting action of other vitamin K-dependent plasma proteins, such as 

25 factor VII, factor IX, and factor X, activated protein C acts as a regulator of the coagulation process through the 
inactivation of factor Va and factor Villa by limited proteolysis. The inactivation of factors Va and Villa by protein 
C is dependent upon the presence of acidic phospholipids and calcium ions. Protein S has been reported to 
regulate this activity by accelerating the APC-catalyzed proteolysis of factor Va (Walker, J. Biol. Chem. 255 : 
552I-5524, I980). 

30 Protein C has also been implicated in the action of tissue-type plasminogen activator (Kisiel and Fujikawa, 
Behring Inst. Mitt. 73 : 29-42, 1983). Infusion of bovine APC into dogs results in increased plasminogen activator 
activity (Comp and Esmon, J. Clin. Invest. 68: 122I-I228, 198I). Recent studies (Sakata et al., Proc. Natl. Acad. 
Sci. USA 82: II2I-II25, I985) have shown that addition of APC to cultured endothelial cells leads to a rapid, 
dose-dependent increase in fibrinolytic activity in the conditioned media, reflecting increases in the activity of 

35 both urokinase-related and tissue-type plasminogen activators by the cells. APC treatment also results in a 
dose-dependent decrease in antiactivator activity. 

Protein C deficiency is associated with recurrent thrombotic disease (Broekmans et al.. New Eng. J. Med. 
309 : 340-344, I983; and Seligsohn et al., New Eng. J. Med. 310 : 559-562, I984) and may result from genetic 
disorder or from trauma, such as liver disease or surgery. This condition is generally treated with oral 

40 anticoagulants. Beneficial effects have also been obtained through the infusion of protein C-containing normal 
plasma (see Gardiner and Griffin in Prog, in Hematology , ed. Brown, Grune & Stratton, NY, ]3: 265-278). In 
addition, some investigators have discovered that the anti-coagulant activity of protein C is useful in treating 
thrombotic disorders, such as venous thrombosis (Smith et al., PCT Publication No. WO 85/0052!) . In some 
parts of the world, it is estimated that approximately I in 16,000 individuals exhibit protein C deficiency. Further, 

45 a total deficiency in protein C is fatal in newborns. 

Finally, exogenous protein C has been shown to prevent the coagulopathy and lethal effects of gram 
negative septicemia (Taylor et al., J. Clin. Invest. 79 : 918-925, 1987). Data obtained from studies with baboons 
suggest that protein C plays a natural role in protecting against septicemia. 

While natural protein C may be purified from clotting factor concentrates (Marlar et al., Blood 59 : 1067-1072) 

50 or from plasma (Kisiel, ibid.), it is a complex and expensive process, in part due to the limited availability of the 
starting material and the low concentration of protein C in plasma Furthermore, the therapeutic use of 
products derived from human blood carries the risk of disease transmission by, for example, hepatitis virus, 
cytomegalovirus, or the causative agent of acquired immune deficiency syndrome (AIDS). In view of protein 
C's clini cal applicability in the treatment of thrombotic disorders, the production of useful quantities of protein 

55 C and activated protein C is clearly invaluable. 

Disclosure of the Invention 

Briefly stated, the present invention discloses DNA sequences which code for proteins having substantially 
the same structure and/or biological activity as human protein C or human activated protein C. In one aspect of 
60 the present invention, the DNA sequence further codes for the amino acid sequence (Ri) n -R2-R3-R4, wherein 
Ri. R2. R3 and R4 are Lys or Arg and n = 0, l, 2, or 3, at the cleavage site of the light and heavy chains. In a 
preferred embodiment, the amino acid sequence at the cleavage site is Arg-Arg-Lys-Arg. in another aspect of 
the present inven tion, the protein includes the substitution of residue 158 (Asp) with a non-acidic amino acid 
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residue such as Ala, Ser, Thr or Gly. In a related aspect, the protein includes the substitution of residue I54 

ImkLw 9 JT I"™"? aC,d Xesm& SUCh 38 or Ar 9- ln anotner ^P^t. the protein includes the 
substitution of the Lys-Arg at residues 156-157 of native protein C with Lys-Lys or Arg-Arg 

Yet another aspect of the present invention is directed toward a DNA sequence which codes for a protein 
having , subtantially -the same biological activity as human protein C orhuman activated protein C. the sequence 
further encoding the pre-pro pept.de of a protein such as factor VII, factor IX, factor X, prothrombin or protein 

J^fin";, *? PreSent in r nt J°. n discl °? es expression vectors capable of integration in mammalian host 
snLw,; 2 *f c 9 8 Pr £ m ? er f °"? Wed downstream by a DNA sequence which encodes a protein having 
substantially the same structure and/or activity as human protein C or human activated protein C as set forth 
Miow^^^c '° n k nuc ' eo !i de sequence being directed by the promoter. The nucleotide sequence is 
followed downstream by a polyadenylation signal. In one embodiment, the expression vector includes a 
thi fl^h^i 00 ^ b T aa P^H nucleotide sequence and the. polyadenylation signal, transcription of 
spHce sitls y promoter - expression vector may also include a set of RNA 

A related aspect of the present invention discloses mammalian cells transfected to express a protein which 
upon activation has substantially the same biological activity as human activated protein C The mammatan 
cells are trans acted with an expression vector capable of integration in mammalian host cell^DN A the 
expression vector including a promoter followed downstream by a DNA sequence as described above Within 
M^li 3 seec f bk \ mar * er is also Introduced into the cells and stably transfected cells are 
selected. Mammalian cells transfected to express a protein which has substantially the same biological activity 

COS" BHK ^3 ceSs!" ° " Preferred hoSt cells for use the P^ent invention ^ 

«„h^n!?l, a t h PeCt ° f *; e , inve nti°n discloses a method for producing a protein which, upon activation, has 
fntn 1 ™=. sa n-e b.olo g! cal activity as human activated protein C. The method comprises (a) introducing 
In!??? h ?\ ex ?T S, ° n V80t0r com P ris ing a DNA sequence as described above, which 

encodes a protein having substantially the same structure and/or activity as human protein C; (b) growing said 
mammalian host cell in an appropriate medium; and (c) isolating the protein product encoded by said DNA 
sequence and produced by said mammalian host cell. The protein product produced according to this method 
is also disclosed. A rnethod for producing a protein which has substantially the same structure and/or 
biological activity as human activated protein C is also disclosed. 

The proteins described within the present invention may be used as active therapeutic substances 
including use in the regulation of blood coagulation. Further, these proteins may be combined with a 
physiologically acceptable carrier and/or diluent to provide suitable pharmaceutical compositions 
di^i 1 ng r s aSPeCtS ° f inVenti ° n Wi " becom e evident upon reference to the detailed description and attached 35 

Brief Description of the Drawings 

an open^'ox* reSt " Cti0n map ° f tne protein C cDNA in PHCA£I_ The coding region is indicated by 

Figure 2 illustrates the nucleotide sequence of the complete protein C cDNA and the deduced amino 
acid sequence of protein C. Arrows indicate cleavage sites for removal of the connecting dipeptide and 
activation peptide. s K H u 

Figure 3 illustrates a restriction enzyme map of the genomic DNA coding for human protein C. Numbers 
below the line indicate length in kilobases (kb). 

Figure 4 illustrates the complete genomic sequence, including exons and introns, of the human protein 
t a°?t / r ° W J heada in A di ° ate intron-exon splice junctions. The polyadenylation or processing sequences 
of a-T-T-A-A-A and A-A-T-A-A-A at the 3' end are boxed. ♦. potential carbohydrate attachment sites- y 
apparent cleavage sites for processing of the connecting dipeptide; i , site of cleavage in the heevy chain 
when protein C is converted to activated protein C; • . sites of polyadenylation. 

Figure 5 illustrates a schematic two-dimensional model for the structure of human protein C 

Figure 6 illustrates the subcloning of the 5' and 3' portions of a protein C partial cDNA clone ' 
^ Figure 7 illustrates the removal of intron A from the genomic clone, resulting in the fusion of exons I and 

Figure 8 illustrates the fusion of exons I and II to the 5'-most portion of the cDNA insert of figure I 
Figure 9 illustrates the construction of a plasmid comprising the complete coding sequence for protein 

Figure 10 illustrates the expression vector pD7C. Symbols used are ori. the adenovirus 5 0-1 map unit 
sequence; E the SV40 enhancer; Ad2MLP, the adenovirus 2 major late promoter; U-3. the adenovirus 2 

EKELStS? f f ? ' SS ' 3 ' SP ' iCe SKe: pAl the SV40 ear * Pelyadenylation signal; and A, 60 

the deleted region ofthepBR322 poison" sequences. 

Figure II illustrates the expression vector pD5(PC-DHFRr). DHFRr denotes the methotrexate resistant 
mutant d.hydrofolate reductase gene sequence; pA denotes the SV40 late polyadenylation signal. Other 
symbols used are as described for Figure 10. 

Figure 12 illustrates the expression vector pDX/PC. Symbols used are as described for Figure II. 55 
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Figure 13 illustrates the results of an assay for activated protein C on media samples from transfected 
293 ceils. 

Figure 14 illustrates the expression vectors pDX/PC962 and PC962/229. 

Figure 15 illustrates the anticoagulant activity of protein C prepared according to certain embodiments 
5 of the present invention. 

Best Mode for Carrying Out the Invention 

Prior to setting forth the invention, it may be helpful to an understanding thereof to set forth definitions of 
certain terms to be used hereinafter. 

10 

Biological Activity : 

A function or set of functions performed by a molecule in a biological context (i.e.. in an organism or an 1 in 
vitro facsimile). Biological activities of proteins may be divided into catalytic and effector activities. Catalytic 
activities of the vitamin K-dependent plasma proteins generally involve the specific proteolytic cleavage of 
15 other plasma proteins, resulting in activation or deactivation of the substrate. Effector activities include 
specific binding of the biologically active molecule to calcium, phospholipids or other small molecules, to 
macromolecules, such as proteins, or to cells. Effector activity frequently augments, or is essential to, catalytic 
activity under physiological conditions. 

For protein C, biological activity is characterized by its anticoagulant and fibrinolytic properties. Protein C, 
20 when activated, inactivates factor Va and factor Villa in the presence of phospholipid and calcium. Protein S 
appears to be involved in the regulation of this function (Walker, ibid.). Activated protein C also enhances 
fibrinolysis, an effect believed to be mediated by the lowering of levels of plasminogen activator inhibitors (van 
Hinsbergh et al., Blood 65: 444-451, 1985). As more fully described below, that portion of protein C encoded by 
exons VII and VIII of the protein C gene is primarily responsible for its catalytic activities. 

25 

Pre-Pro Peptide : 

An amino acid sequence which occurs at the amino terminus of some proteins and is generally cleaved from 
the protein during translocation. Pre-pro peptides comprise sequences directing the protein into the secretion 
pathway of the cell (signal sequences) and are characterized by the presence of a core of hydrophobic amino 
30 acids. They may also comprise processing signals. As used herein, the term "pre-pro peptide" may also mean 
a portion of a naturally occuring pre-pro peptide. 

Expression Unit : 

A DNA construct comprising a primary nucleotide sequence encoding a protein of interest, together with 
35 other nucleotide sequences which direct and control the transcription of the primary nucleotide sequence. An 
expression unit consists of at least the primary nucleotide sequence and a promoter sequence located 
upstream from and operably linked to the primary nucleotide sequence and a polyadenylation signal located 
downstream. Additional genetic elements may also be included to enhance efficiency of expression. These 
elements include enhancer sequences, leaders, and mRNA splice sites. 

40 

Expression Vector : 

A DNA molecule which contains, inter alia, a DNA sequence encoding a protein of interest together with a 
promoter and other sequences which facilitate expression of the protein. Expression vectors further contain 
genetic information which provides for their replication in a host cell, either by autonomous replication or by 
45 integration into the host genome. Examples of expression vectors commonly used for recombinant DNA are 
plasmids and certain viruses, although they may contain elements of both. They also may include a selectable 
marker. . 

As noted above, protein C is produced in the liver and requires vitamin K for its biosynthesis. Vitamin K is 
necessary for the formation of specific gamma-carboxyglutamic acid residues in the aminc-terminal region of 
50 the light chain. These amino acid residues are formed by a post-translational modification, and are required for, 
calcium-mediated binding to phospholipid. In addition, protein C contains one beta-hydroxyaspartic acid 
residue which is also formed in a post-translational modification. However, the role of this amino acid residue is 
not known. 

Given the fact that the activity of protein C is dependent upon post-translational modifications involving the 
55 gamma carboxylation of specific glutamic acid residues and cleavage to the two-chain form, and may also be 
dependent upon the hydroxylation of a specific aspartic acid residue, it is unlikely that an active product could 
be produced through the cloning and expression of protein C in a microorganism. 

Accordingly, the present invention provides a method of producing a protein which is gamma-carboxylated 
and, upon activation, has the biological activity of human activated protein C through the use of mammalian 
60 cells transfected to stably express the protein. 

The present invention further provides a method for producing a protein which is gamma-carboxylated and 
has the biological activity of human activated protein C without the necessity for activation. 

The light and heavy chains of bovine protein C have been sequenced (Fernlund and Stenflo. J. Biol. Chem. 
257: 12I70-I2I79, 1982; and Stenflo and Fernlund. J. Biol Chem . 257: I2I80-I2I90, 1982). Isolation and 
65 characterization of human protein C have been described by Kisiel, J. Clin. Invest . 64: 761-769, 1979. The 
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anticoagulant activities of both the human and bovine enzymes were found to be highly species specific 
Species specificity is believed to be mediated by protein S (Walker, Thromb. Res. 22: 32I-327 I98l) However' 
the human and bovine proteins show considerable overall structural homology to~each other and to other 
vitamin K-dependent plasma proteins, including prothrombin, factor VII, factor IX, and factor X. Similarities 
include the presence of the Gla residues in the light chain and the active site serine in the heavy chain as well s 
as other amino acid sequence homology, in the amino-terminal region of the light chain 

Within the present invention, a Xgtll cDNA fibrary was prepared from human liver mRNA. This library was 
then screened with ^l-labeled antibody to human protein C. Antibody-reactive clones were further analyzed 
for the synthesis of a fusion protein of p-galactosidase and protein C In the Xgtll vector 

™!Li™ ! h f 9 ^f A a Str0n9 Slgnal With the antibod y P^be and was found to contain an insert of w 

approximately WOO bp. DNA sequence analysis of the DNA insert revealed a predicted amino acid sequence 
which shows a high degree of homology to major portions of bovine protein C, as determined by Fernlund and 
Stenflo ( J. Biol. Chem. 257 : I2I70-I2I79; J. Biol. Chem. 257 : I2I80-I2I90) . 

The DNA insert contained the majority of the coding region for protein C beginning with amino acid 64 of the 
light chain including the entire heavy chain coding region, and proceeding to the termination codon Further 15 
ol owing the stop codon of the heavy chain, there were 294 base pairs of 3' noncoding sequence and a poly' 
(A) tail of 9 base pairs. The processing or poryadenylation signal A-A-T-A-A-A was present 13 base pairs 
upstream from the poly (A) tail in this cDNA insert. This sequence was one 30 of two potential polyadenylation 

The cDNA sequence also encodes the dipeptide Lys-Arg at position I56-I57 (numbering of amino acids is 20 
shown in Figure 2), which separates the light chain from the heavy chain and is removed during processlnq bv 
proteolytic cleavage resulting in secretion of the two-chain molecule. Upon activation of the two-chain 
molecule by thrombin, the heavy chain of human protein C is cleaved between arginine-169 and leucine-I70 
releasrng the activation peptide (Figure 2). 

By a similar method, a second cDNA which lacked the coding sequence for the pre-pro peptide and the first 25 
23 amino acids of protein C was isolated. Using this cDNA as a hybridization probe, the remainder of the 
coding sequence was obtained from a human genomic DNA library in X Charon 4A (Foster et a!.. Proc Natl 
Acad. Sci. USA 82: 4673-4677, I985). Three different X Charon 4A phage were isolated tha contained 
overlapping inserts for the protein C gene. 

The positions of exons on the three phage clones were determined by Southern blot hybridization of diqests 30 
of these clones with probes made from the 1400 bp cDNA described above. The genomic DNA inserts in these 
cones were mapped by single and double restriction enzyme digestion followed by agarose qel 
electrophoresis, Southern blotting, and hybridization to radiolabeled 5' and 3' probes derived from the cDNA 
for human protein C. as shown in Figure 3. 

DNA sequencing studies were performed using the dideoxy chain-termination method. As shown in Figure 35 
4, the nucleotide sequence for the gene for human protein C spans approximately 11 kb of DNA. These studies 
further revealed a potential pre-pro peptide of 42 amino acids. The pre-pro sequence is cleaved by a signal 
peptidase following the Gly residue at position -25. Processing to the mature protein involves additional 
proteolytic cleavage following residue -I to remove the amino-terminal propeptide, and at residues 155 and 157 
to remove the Lys-Arg dipeptide which connects the light and heavy chains. This final processing yields a light 40 
chain of 155 ammo acids and a heavy chain of 262 amino acids. 

The protein C gene is composed of eight exons ranging in size from 25 to 885 nucleotides, and seven introns 
ranging ,n size from 92 to 2668 nucleotides. Exon I and a portion of exon II code for the 42 amino acid pre-pro 
peptide The remaining portion of exon II, exon 111, exon IV, exon V, and a portion of exon VI code for the lioht 
chain of protein C. The remaining portion of exon VI, exon VII. and exon VIII code for the heavy chain of protein 
C. The amino acid and DNA sequences for a cDNA coding for human protein C are shown in Figure 2 

u 6 'r, tr0nS ' n thS 9Bne f ° r protein C are ,ocated P rlmari| y between various functional domains. Exon II spans 
the highly conserved region of the pre-pro peptide and the gamma-carboxyglutamlc acid (Gia) domain. Exon 111 
includes a stretch of eight amino acids which connect the Gla and growth factor domains. Exons IV and V each 
represent a potential growth factor domain, while exon VI covers a connecting region which includes the 
activation peptide. Exons VII and VIII cover the catalytic domain typical of all serine proteases 

The amino acid sequence and proposed structure for human pre-pro protein C are shown in Figure 5 
Protein C is shown without the Lys-Arg dipeptide, which connects the light and heavy chains. The location of 
sitera^cTrcied 8 thr ° U9h G> iS indlcated b V s6lid bars - A ™no acids flanking known proteolytic cleavage 

designates potential carbohydrate binding sites. The first amino acid in the light chain, activation peptide ^ 
and heavy chain start with number I. This numbering differs from that shown in Figures 2 and 4 

Carbohydrate attachment sites are located at residue 97 in the light chain and residues 79, 144, and 160 in the 
heavy chain, according to the numbering scheme of Figure 5. The carbohydrate moiety is covalently linked to 

I' V?l ma J° rrtv of ^stances, the carbohydrate attachment environment can be represented by Asn-X-Ser so 
or Asn-X-Thr, where X = any amino acid. 

As noted above j protein C plays a regulatory role in the coagulation process. The catalytic domain, encoded 
by exons VII and VIII, possesses serine protease activity which specifically cleaves certain plasma proteins 
(i.e. factors Va and Villa), resulting in their deactivation. As a result of this selective proteolysis, protein C 
displays anticoagulant and fibrinolytic activities. _ 
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Due to the presence of intervening sequences in the genomic clone, merely joining the genomic and cDNA 
sequences to provide a complete coding sequence is not sufficient for constructing an acceptable expression 
unit. It is therefore necessary to delete these intervening sequences for reasons more fully described below if 
a genomic clone is used to construct the expression unit. 

5 The 5' coding region may also be obtained by alternative methods and consequently eliminate the need to 
delete intervening sequences. The 5' coding region may be obtained by using probes derived from the existing 
cDNA or genomic clones to probe additional libraries. By this method, a full-length cDNA was isolated. 
Furthermore, the amino-terminal portions of the vitamin K-dependent plasma proteins are responsible for their 
respective calcium binding activities. It has been found that, as a result of this functional homology, the 

10 calcium binding domains of these molecules may be interchanged and still retain the activity specific to the 
catalytic domain of the resultant molecule. For example, as. described in U.S. Patent Application Serial 
No. 724.3H, filed April 17, 1985, and published as European Patent Office publication 200,421, the amino-terminal 
portion (calcium binding domain) of factor IX may be joined to factor VII at amino acid 38 to produce a protein 
having the activity of factor VII. Factor VII, factor IX, factor X, prothrombin, and protein S share this 

15 amino-terminal sequence homology with protein C. Consequently, a cloned sequence comprising the 
5'-coding region of the gene for any of these proteins might be substituted for the corresponding sequence of 
the protein C gene. Additionally, suitable coding sequences may be synthesized based on the known amino 
acid sequences of several of the vitamin K-dependent plasma proteins or on the sequence of protein C 
disclosed herein. Techniques for producing synthetic nucleotide sequences are well known in the art. For 

20 example, a set of overlapping oligonucleotides may be synthesized and annealed in pairs to yield 
double-stranded fragments with overlapping cohesive termini. These fragments are then ligated as any 
restriction fragments would be. The resultant synthetic fragment is then ligated to the cDNA at a convenient 
restriction site. The junction sequence may be modified as necessary by oligonucleotide-directed 
mutagenesis. 

25 When clones representing the entire coding sequence have been obtained, the appropriate regions may be 
joined, as necessary, to generate the desired coding sequence. Fragments obtained from one or more 
libraries are cut with appropriate restriction endonucleases and joined together enzymatically in the proper 
orientation. Depending on the fragments and the particular restriction endonucleases chosen, it may be 
necessary to remove unwanted DNA sequences through a "loop out" process of deletion mutagenesis or 

30 through a combination of restriction endonuclease cleavage and mutagenesis. The sequence so obtained 
should preferably be in the form of a continuous open reading frame, that is, that it lack the intervening 
sequences (introns) generally found on higher eukaryotic genes. The presence of introns in cloned genes may 
lead to aberrant splicing of messenger RNA and/or reduced efficiency of gene expression or instability upon 
amplification when the gene sequence is introduced into a mammalian host cell, it is preferred that this coding 

35 sequence further encode a pre-pro peptide in order to facilitate proper processing and secretion of the protein 
C produced according to the present invention. The pre-pro peptide may be that of protein C or another 
secreted protein, such as factor IX, factor VII, or prothrombin. 

Under some circumstances, it may be desirable to produce activated protein C directly, thereby removing 
the need to activate the protein product either in vitro or in vivo . The cleavage sites involved in the maturation 

40 and activation of protein C are known (Foster and Davie, ibid.). A sequence encoding APC may be constructed 
by deleting the region encoding the activation peptide through oligonucleotide-directed deletion mutagenesis. 
The resultant protein will then become activated by removal of the Lys-Arg dipeptide during normal proteolytic 
processing in the secretion pathway of the host cell. It has been found that proteins lacking the activation 
peptide are nevertheless properly processed by the host cells, resulting in secretion of activated protein C. 

45 In order to enhance the proteolytic processing involved in the maturation of the recombinant protein C to 
the two-chain form, it may be desirable to modify the amino acid sequence around the processing site. Such 
modification has been found to facilitate the proper processing of recombinant protein C in transfected cells. 

Efficient cleavage may increase the specific activity of the protein, given that single-chain protein C is not 
known to be activated in the bloodstream. As previously noted, this maturation process involves the removal 

50 of the dipeptide Lys-Arg (amino acids I56-I57) (Foster and Davie, Proc. Natl. Acad. Sci. USA 8l: 4766-4770, 
I984). Modifications to the amino acid sequence in the vicinity of this processing site include the substitution 
and/or insertion of amino acids. One such group of modifications is the alteration of the amino acid sequence 
to include the sequence (Ri) n -R2-R3-R4. wherein Ri, R2, R3 and R 4 are Lys or Arg and n«0, 1, 2 or 3, in place 
of the native Lys-Arg dipeptide. A particularly preferred modification of this group is the sequence 

55 Arg-Arg-Lys-Arg. This sequence has been found to enhance processing of recombinant protein C by about 
five-fold in transfected BHK cells. A second group of modifications includes the substitution of amino acid 
residue I54 (His) of native protein C with a basic amino acid residue (i.e., Lys or Arg) to give a processing site 
sequence of the general formula R1-X-R2-R3, wherein Ri. R2, and R3 are Lys or Arg, and X is an amino acid 
other than Lys or Arg, preferably Leu. A third group of modifications includes substitution of the Asp residue at 

60 position 158 with a non-acidic amino acid residue. Use of a small neutral amino acid, such as Ala, Ser, Thr or Gly 
is preferred. A fourth group of modifications includes the substitution of Lys-Lys or Arg-Arg for the Lys-Arg of 
native protein C. Combinations of these groups of modifications may also be made. For example, amino acid 
158 may be substituted in a protein C molecule containing a processing site having the sequence 
(Ri)n-R2-R3-R4. These modifications can be used in producing wild-type protein C or activated protein C. 

65 The coding sequence for protein C or activated protein C is then inserted into a suitable expression vector 
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In " !. ,ransfect a "mammalian cell line. Expression vectors for use in carrying out the present 
mlrn^Ln T™f 8 * "T*™ * tiWBCtin9 the ^"script™ of a foreign gene introduced into a 

mammalian cell. Viral promoters are preferred due to their efficiency in directing transcription A particularly 

conS ^ZT^a th , 6 ma l° r f Pr ° m0ter fr0m adenovi ™ 2- Such expression vector X^X2 % 

to thTn^!, ?r ~n f P Slte f.K° C fI ed downstream from the P™°ter and upstream from the insertion site 5 

C «*»noe °' within the protein C sequence itself. Preferred RNA splice sites may be obtained 
«Zl?ZT* l °J lmmu "°9 |obuli " OM* Also contained in the expression vectors is a polyadenylation 
S^nM rfT °J ^ inSerti ° n Slte - ^ Polyadenylation signaJs are particularly preferred! such 
rpo „ in^ ? te 1 P° | y ade "y ,a « on signals from SV40 or the polyadenylation signal from the adenovirus 5 Elb 
1*™™ \ P h Pji eferred embodiment, the expression vector also comprises a noncoding viral leader 10 

pX4d vector^ ^ 6nOW 7 S , 2 T"' '° CatBd betW6en the pr ° moter and the splice sites. 

J^Z r. * ? ^ , ' nClUde enhancer sequences, such as the SV40 enhancer and the sequences 
encoding the adenovirus VA RNAs. »»huoiiw» 

nhnLXl 9 ^! 1 !. 8 !^!" 0 ^ T "IS introduoed int ° cultured mammalian cells by. for example, calcium 

? «nS h ,no." m . ed L ated transfectlon (Wigleret al.. Cell 14: 725. 1978; Corsaro and Pearson. Somatic Cell Genetics f5 

Qr ?* m and der Eb. Virology 52: 456. 1973). A precipitate is formed o f the DNAand calcium 
5,« ^n* 1 I' LP reoi P ,tate ,s Wl'ed to the cells. Some of the cells take up the DNA and maintain it inside 
LrLr n.?^ H ayS Ismail fraction of these cells (typically 10-4, integrate the DNA into the genome In 
order to dentrfy these integrants, a gene that confers a selectable phenotype (a selectable marker) is 
generally .ntroduced into the cells along with the gene of interest. Preferred selectable markers include genes 
hat confer res.stance to drugs, such as neomycin, hygromycin. and methotrexate. Selectable markers may be 
n reduced into the cell on a separate plasmid at the same time as the gene of interest, or they may be 
undtrZtnnt*! ^'"f^ « on the same the and the gene of Interest 7Iy oe 

E£h ™ thT ° f drff ° re , nt ^ ot P oters ° r same promoter. In one embodiment, the selectable marker is 
placed on the same plasmid with the sequence encoding protein C such that both sequences are controlled bv 
the same promoter, an arrangement known as a dicistronic message. Constructs of this type are known in the 
art (for example European Patent Office publication 117.058). It may also be advantageous to add additional 

th« hmaT 38 Ca T r D , NA< t0 the miXtUre Wh,Ch is **«■«»•<» '"to the cells. After the cells have taken up 
the DNA. they are allowed to grow for a period of time, typically 1-2 days, to begin expressing the gene of 

'r^r i„ 9 t S K e t°K S ^ 8PP,ied t0 SB,eCt tor the srowtn of cells which are" expressing the sePectebS 
marker in a stable fashion. Clones of such cells may be screened for expression of protein C 

Preferred mammalian cell lines for use in the present invention include the COS, BHK and 293 cell lines In 

{ £?»T^ H zm? % S S'SST lWer (ATCC cc ^ L DUKX ce,,s ^-5SK ss 

ttJUHS^^JH^ iA J CC CRL l573: Q, ; aham St aJ - J Gen Virol - 5§ : 59 - 72 - '977) is particularly preferred, due to 
its ability to efficiently process protein C to the two-chain form. This cell line is transformed with human 

wfth^TpN^-^f T th6 , Ad5 HA 98ne inte9rated into its 9 enome - ^ferred expression vectors for use 
with 293 cells will include an adenovirus promoter. Neomycin resistance is a preferred selectable marker for 

EffS'S S£!K£ ^ ^ *" Ce " ' ine BHK57 ° (Wa8Chter and Baserga.PrSa" 

^^ e = ( ; t ^ Py n ^Umbe^ ^ 0, the J,u te J 9 ^ d gene se 9"ance may be increased through amplification by using certain 
, ♦ H 8rS L 6 - 9 ;* d ^ drofolate reductase, which confers resistance to methotrexate). The selectee 

™nti °K d ,nt ° the Ce " S al0n9 *"* thS 9ene of interest - dnj 9 selection is applied. The drug 

t ' S the " lncreased ln 3 stepwise manner, with selection of resistant cells at each step By 

subsSiiS '^cr e eased COPy * SeqUences ' expression levels of the encoded protein may be 

n SlT™<° pr0duc * d according to the present invention is preferably purffled. as by affinity chromatography 
on an anti-prote.n C ent.body column. Additional purification of the column eluate may be achieved by 
conventional chemical purification means, such as high-performance liquid chromatography (HPLC) 

P roduoed . according to the present invention may be activated by removal of the activation peptide 
n2 ,.nHh ter ™ nus ° f , the heavy chain. Activation may be achieved by incubating protein C in the 
presence of a-thrombin (Marlar et al.. BJood 59: 1067-1072. 1982). trypsin (Mariar et al.. ibid.). Russell's viper 

(AmericS; ^ Diagno a sticT r (KiSie '" ^ *" COmmerci ^ avai,able venom-derived aerator Prot£ C 55 

h,,m J nT^n = * ex * m ?° s f0,| P w ' Sample I describes the cloning of DNA sequences encoding 

human protein C. Example 2 descnbes thfe construction of a full-length coding sequence for protein C from the 

hZ^^TE m , ^ ^ the P rodootion ct P ro tein C using transfected mammalian cells. Example 5 so 
Z^lt a ful, - ,e "9* cDNA encoding protein C and its expression in transfected mammalian cells. Example 6 

orot/n r i „ Pr ° dUCtl0n ° f aCtivated pratein C in BUK ^ 293 ce,ls - Example 7 describes the production of 
protein C from a precursor having a modified cleavage site. Example 8 describes the use of the Factor VII 
pre-pro peptide or the prothrombin pre-pro peptide to direct the secretion of protein C from transfected cells. 

• as 
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EXAMPLES 

Restriction endonucleases and other DNA modification enzymes (e.g., T4 polynucleotide kinase, calf 
alkaline phosphatase, Klenow DNA polymerase. T4 polynucleotide ligase) were obtained from Bethesda 
Research. Laboratories (BRL) and New England Biolabs and were used as directed by the manufacturer, 
5 unless otherwise noted. 

Oligonucleotides may be synthesized on an Applied Biosystems Model 380 A DNA synthesizer and purified 
by polyacryiamide gel electrophoresis on denaturing gels. E. coli cells may be transformed as described by 
Maniatis et al. (Molecular Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory, 1982). MI3 and pUC 
cloning vectors and host strains were obtained from BRL. 

10 

Example I 

Cloning of DNA Sequences Encoding Human Protein C 
A cDNA coding for a portion of human protein C was prepared as described by Foster and Davie (ibid.). 

15 Briefly, a Xgtll cDNA library was prepared from human liver mRNA by conventional methods. Clones were 
screened using 125 l-Iabeled affinity-purified antibody to human protein C, and phage were prepared from 
positive clones by the plate lysate method (Maniatis et al., ibid.), followed by banding on a cesium chloride 
gradient. The cDNA inserts were removed using Eco Rl and subcloned into plasmid pUC9 (Vieiraand Messing, 
Gene I9: 259-268, I982). Restriction fragments were subcloned in the phage vectors MI3mplO and MI3mpH 

20 (Messing, Meth. in Enzymology IQI: 20-77, 1983) and sequenced by the dideoxy method (Sanger et al., Proc. 
Natl. Acad. Sci. USA 74: 5463-5467, 1977). A clone was selected which contained DNA corresponding to the 
known partial sequence of human protein C (Kisiel, ibid.) and encoded protein C beginning at amino acid 64 of 
the light chain and extending through the heavy chain and into the 3' non-coding region. This clone was 
designated XHCI375. A second cDNA clone coding for protein C from amino acid 24 was identified. The insert 

25 from this clone was subcloned into pUC9 and the plasmid designated pHC)u6L (Figure I). This clone encodes a 
major portion of protein C, including the heavy chain coding region, termination codon, and 3' non-coding 
region. 

The cDNA insert from XHCI375 was nick translated using a- 32 p dNTp's and used to probe a human genomic 
library in phage X Charon 4A (Maniatis et al.. Cell 15: 687-702. 1978) using the plaque hybridization procedure of 
30 Benton and Davis (Science I96 : I8M82, I977) as modified by Woo ( Meth. in Enzymology 68: 38I-395, I979). 
Positive clones were isolated and plaque-purified (Foster et al , Proc. Natl. Acad. Sci. USA 82: 4673-4677, 1985, 
herein incorporated by reference). Phage DNA prepared from positive clones (Silhavy et al., in Experiments 
with Gene Fusion , Cold Spring Harbor Laboratory, I984) was digested with Eco Rl or Bgl II and the genomic 
inserts purified and subcloned in pUC9. Insert restriction fragments were subcloned into MI3 vectors and 
35 sequenced to confirm their identity and establish the DNA sequence of the entire gene. 

The cDNA insert of pHCX6L was nick translated and used to probe the phage X Charon 4A library. One 
genomic clone was identified which hybridized to probes made from the 5' and 3' ends of the cDNA. This 
- phage clone was digested with Eco Rl and a 4.4 kb fragment, corresponding to the 5' end of the protein C 
gene, was subcloned into pUC9. The resultant recombinant plasmid was designated pHCR4.4. Complete DNA 
40 sequence analysis revealed that the insert in pHCR4.4 comprised two exons of 70 and I67 base pairs 
separated by an intron of 1263 bp. The first exon encodes amino acids -42 to -19; the second encodes amino 
acids -19 to 37. Sequence analysis confirmed the DNA sequence of the entire protein C gene. 

As noted above, It is then necessary to remove the intron in order to use a genomic clone to construct an 
acceptable coding sequence for use within the present invention. 

45 

' Example 2 

Construction of a Full-Length Coding Sequence for Protein C 

A full-length coding sequence for Protein C, including the pre-pro peptide, is constructed by joining the 
50 appropriate fragments of the cDNA and genomic clones. This is accomplished by removing the intron from the 
genomic clone (pHCR4.4) and joining the fused exons to the cDNA (from pHCX6L) at convenient restriction 
sites. The desired genomic :cDN A junction is then generated by looping out unwanted sequences by 
oligonucleotide-directed deletion mutagenesis. 

Plasmid pHCXSL contains the protein C partial cDNA cloned in the Eco Rl site of pUC9 (Figure I). The cDNA 
55 insert is subcloned in two fragments to prepare it for joining to the 5'-most coding region from the genomic 
clone. Plasmid pHCX6L is digested with Eco Rl and Sal I, and the reaction mixture fs extracted with phenol and 
CHCI3, then ethanol-precipitated. The resulting DNA fragments are resuspended in ligation buffer, and T4 DNA 
ligase is added. The ligation mixture is incubated at I5°C for 14 hours. An aliquot of the ligation mix is used to 
transform E. coli JM83, and the cells are plated on LB agar containing X-gal. White colonies are selected, and 
60 plasmid DNA is prepared. The DNA is analyzed by restriction enzyme digestion to identify clones containing 
the 3' portion of the cDNA (ca. I450 bp insert) and the 5' portion of the cDNA (ca. 65 bp insert). These clones 
are designated p9C3' and p9C5', respectively (Figure 6). 

The 5' coding region missing from the cDNA is contained in exons I and II of the genomic clone pHCR4.4. 
This plasmid contains an insert of approximately 4400 base pairs and terminates on its 3' end at an Eco Rl site 
65 located in intron B. 
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r Jumn^J^ntfJl" 8 Sequ f " c t S fr ,° m P H< =R4.4. the plasmid is digested with Pstl and Eco Rl and the 
resulting fragments are separated by electrophoresis in an agarose gel. The ca2540 bp fragment containing 

EST™ flman ^ £ ^ ^ ° TAB « Ana£t. slcH^ 

I980). Ths fragment, des.gnated 5'P-R. is subcloned into P UC9 to produce pla smid p5'P-R (figu red) 

din^d wrTh a ^ k- ( K S f 9nat6d intr ° n A) ' iS removed in a two - ste P P roces * <R B u™ 7). The^md is 5 
P S II TtheJfreateS'^h 'IT,? 6 ^ I" intr ° n and ,eaVes 3 ' ^erhakging ends. The linearized 

«nri Tnn Zl r n f ? Ba ' 31 exonuclea se or T 4 polymer ase to remove approximately 400 bp from each 
tn a « „„J u£ ♦ T f, ! 9me 2 t 6ndS b,Unted with Sl nuc,ease - 1116 linearized plasmid is ^circularized w£h 
ano^st.^ 

chosen ISTSSSSSST! PS'pE * ^ 3 ^ ^ a ^ fra9ment redUCed te b P ia » 

The remainder of intron A is removed by oligonucleotide-directed deletion mutagenesis essentially as 
descnbed by Zoller and Smith ( Manual for Advanced Techniques in Molecular Clo ning Course Cold Scri™ 
Harbor Laboratory 1983) for the two-primer method. pS'PAaR is digested with Pst I and Eco Z Fin d Ith proten 
P «J + E °° m «*«™ MI3m P9- P't» strand phage DNA is prepaid as 3p ate 
and annealed to ol.gonucleotlde mut-l (Table I). This mutagenic oligonucleotide comprises sequences 
complementary to the exon I and II sequences to be joined. The MI3 universal sequencing prfmerTs aSeS 3* 
to mut- on the same template. The primers are extended using DNA polymerase I (Kleno^ f^enfaS 

^STXSSai^ ^ ST ° f T4 ° NA " 9aSe - ^ res "'«ng ilex DNA circ.es !SSSS£S 
32n ll^m^non r resulting plaques screened under stringent hybridization conditions using the 
,7 ^ ol£onucleot.de as probe. DNA from positive plaques is isolated and sequenced us nq 
a™ .fZ D H d ^p^„ mer ^ f^ab, ! Y WCh PrimeS ln SXOn "• a "° win9 the d *armination of the D^A sequence 

Ps« I JS SESJ ?^° n ,\ A ^ CUl l haVi " g thS COrreCt inframe fuaion ° f exons 1 ™ d '' * selected. ^ 
Pstl-EcoRI fragment is .solated from the MI3 replicative form by restriction endonuclease digestion and 
agarose gel electrophoresis and is subcloned Into pUC9 to produce plasmid P 5'l-ll (Figure 7) 

F ' 9 o rB *■ tD j0in 5 ' ° odin9 re 9 |on to tne cDNA - ,he ca 1277 bp Pst l-Eco Rl fragment of d5'I-II 
■s .sola tec I from a Pst I + Eco Rl digest of the plasmid and purified by agarose gel electropho^lsTs Ae K bo 
1 rZ°££ l S ' SOlated fr ° m a Sa " + Eco Rl digest of p9C5'and purified by eteS^horoto on an 

" k ^ ^° * a9ments liaated at «"* Eco Rl termini, and the resulting ca. 1335 bp Pst 1-SaH 
suboloned «*» P * 1 + Sal l-digested MI3mp9 (Figure 8). Plus strand phage DNA is prepared as 
pmn 3 ! fo ;°"9 onucle< ' t ' d f -erected deletion mutagenesis. Oligonucleotide mut-2 (Table I) is annealed , tto 
template and ol.gonucleotlde mut-3 (Table I) is annealed upstream as second primer The primers are 
ac d\ 2 £26^?h C e n c b nMf °t Ve ' ™S 0 " u l eOt - ide ™^ dir -ts the fusion of exon I. sequences encoTg anlo 
frnm thf^^f . , £ x^" ^ ^ Se °° nd primer (mut - 3) introduces an Eco Rl site 35 bp upstream 
^L^^T^P^T Th ,V eSU i' "? PhS9e m SCreened forthe absence of Nc ° 1 *" d Xho I sites andTr 
Sno orimer^ ^hteTt^lfSf ™ S ' te - Pha9e ° NA Sh ° Wing the desired restriction pattern is sequenced 
DNA thl ™m " ^ the P resence °* c °rract junction between exon II and the cDNA. Phage 
i^oit^ fror 6 = orrect , seduen ? e ls s f ected. and the Pst l-Sal I fragment comprising the 5' coding region is 
.solated from the repl.cat.ve form of the MIS recombinant phage. The fragment is purified by agarose oel 
electrophores.s and inserted into Pst I + Sal l-digested pUC9 to produce plasmid pCS'end 9 
Referring to F.gure 9, plasmid pC5'end is digested with EcoRI and Sal I. and the 5' protein C fraoment is 

San ,e Eco y rTS f e,eCtr °o P ^ S 0nd BXtraCti ° n With CTAB - ^ remainder of thecDNA is isSd aL a 
oui" £1 fJr 9 i ?9C l ' 17,6 ^ fra9men ts are joined in a three-part ligation to Eco Rl- digested 
pUC9. The ,gat,on m.xture .s used to transform E^coli JM83. the cells are plated on LB + X-gal. and plasmid 
DNA ,s .solated from whrte colonies. The resultant plasmid is designated pMMC. It contains he coS e 45 
codmg sequence for human protein C on a ca. 1500 bp Eco Rl fragment complete 45 

TABLE I 

Oligonucleot ide ~ S equence 50 

mut_1 3'CGA CGA GAA CTG AGT CAC AA5 ' 

mut-2 3 ' CTG AAG CTC CTC CGG TTC CTT TAA5 ' 55 

mUt ~ 3 5< GGA GGA ATT CTG AGC^ ' 

primer-1 5' TTT GCG- GAT CCG CAG^ ' 

primer-2 S' CGA CGT* GCT TGG ACC^ ' * 
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Example 3 

Constructio n of Expression Vectors for Protein C 

The protein Oencoding insert is removed from pMMC as an Eco Rl fragment and inserted into a suitable 
mammalian ceil expression vector. An exemplary vector is pD7, comprising the SV40 enhancer and the 
adenovirus 2 major late promoter and tripartite leader. 

Plasmid pD7 is generated from plasmid pDHFRIU (Berkner and Sharp, Nuc. Acids. Res. 13: 84I-857, I9B5). 
The Pst I site immediately upstream from the DHFR sequence in pDHFRIU was converted to a Bel I site by 
digesting I0 ug of plasmid with 5 units of Pst I for 10' at 37* C in 100 ul buffer A (10 mM Tris pH 8. 10 mM MgCl 2 . 6 
mM NaCI 7mM p-MSH). The DNA was phenol extracted, EtOH precipitated, and resuspended in 40 ul buffer B 
(50 mM Tris pH 8, 7 mM MgC! 2 , 7mM p-MSH) containing I0 mM dCTP and IB units T 4 DNA polymerase and 
incubated at I2°C for 60 minutes. Following EtOH precipitation, the DNA was ligated to 2.5 ug kmased Bel I 
linkers in 14 ul buffer C (10 mM Tris pH 8, 10 mM MgCI 2 . I mM DTT, L4 mM ATP) containing 400 units T 4 
polynucleotide ligase for 12 hours at 12°C. Following phenol extraction and EtOH precipitation, the DNA was 
resuspended in 120 ul buffer D (75 mM KCI, 6 mM Tris pH 7.5, 10 mM MgCla. I mM DTT) , digested with 80 units 
Bel I for 60 minutes at 50°C, then electrophoresed through agarose. Form til plasmid DNA (10 ug) was isolated 
from the gel, and ligated in 10 ul buffer C containing 50 units T 4 polynucleotide ligase for 2 hours at 12° C, and 
used to transform E. coli HBIOI. Positive colonies were identified by rapid DNA preparation analysis, and 
plasmid DNA (designated pDHFR') prepared from positive colonies was transformed into dAM- E. coli. 
) Plasmid pD2' was then generated by cleaving P DHFR'(I5 ug) and pSV40 (comprising Bam HI digested SV40 
DNA cloned into the Bam HI site op pML-l) (25 ug) in 100 ul buffer D with 25 units Bel I for 60 minutes at 50 C. 
followed by the addition of 50 units Bam HI and additional incubation at 37° C for 60 minutes. DNA fragments 
were resolved by agarose gel electrophoresis, and-the 4.9 kb pDHFR' fragment and 0.2 kb SV40 fragment were 
isolated. These fragments (200 ng pDHFR' DNA and 100 ng SV40 DNA) were incubated in 10 ul buffer C 
> containing 100 units T4 polynucleotide ligase for 4 hours at 12° C, and the resulting construct (pD2') was used to 
transform E. coll RRL _ , • 

Plasmid pD2' was modified by deleting the "poison" sequences in the pBR322 region (Lusky and Botchan, 
Nature 293: 79-81, 1981). Plasmids pD2' (6.6 ug) and pML-l (Lusky and Botchan, ibid.) (4 ug) were incubated in 
50 ul buffer A with 10 units each Eco Rl and Nru I for 2 hours at 37° C, followed by agarose gel electrophoresis. 
30 The 1.7 kb pD2' fragment and 1.8 kb pML-l fragment were isolated and ligated together (50 ng each) in 20 ul 
buffer C containing 100 units T 4 polynucleotide ligase for 2 hours at 12° C, followed by transformation into E. coll 
HBI01 Colonies containing the desired construct (designated pD2) were identified by rapid preparation 
analysis. Ten ug of pD2 were then digested with 20 units each Eco Rl and Bgl II, in 50 ul buffer A for 2 hours at 
37° C. The DNA was electrophoresed through agarose, and the desired 2.8 kb fragment (fragment C) 
35 comprising the pBR322, 3' splice site and poly A sequences was isolated. 

To generate the remaining fragments used in constructing pD3, pDHFRIU was modified to convert the Sac 11 
(Sst II) site into either a Hind Ili or Kpn I site. Ten ug pDHFRIU were digested with 20 units Sst II for 2 hours at 
37° C, followed by phenol extraction and ethanol precipitation. Resuspended DNA was incubated in 100 ul 
buffer B containing 10 mM dCTP and 16 units T 4 DNA polymerase for 60 minutes at 12° C, phenol extracted, 
40 dialyzed and ethanol precipitated. DNA (5 ug) was ligated with 50 ng kinased Hind HI or Kpn I linkers in 20 ul 
buffer C containing 400 units T 4 DNA ligase for 10 hours at 12° C, phenol extracted, and ethanol precipitated. 
After resuspension in 50 ul buffer A, the resultant plasmids were digested with 50 units Hind III or Kpn I, as 
appropriate, and electrophoresed through agarose. Gel-isolated DNA (250 ng) was ligated in 30 ul buffer C 
containing 400 units T 4 DNA ligase for 4 hours at I2°C and used to transform E. coli RRI. The resultant plasmids 
45 were designated pDHFRIU (Hind III) and pDHFRIU (Kpn I). A 700 bp Kpn l-Bgl II fragment (fragment A) was then 
purified from pDHFRIU (Kpn I) by digestion with Bgl H and Kpn I followed by agarose gel electrophoresis. 

The SV40 enhancer sequence was inserted into pDHFRIU (Hind III) as follows: 50 ug SV40 DNA was 
incubated in 120 ul buffer A with 50 units Hind III for 2 hours at 37° C, and the Hind III C SV40 fragment (5089-968 
bp) was gel purified. Plasmid pDHFRIU (Hind HI) (10 ug) was treated with 250 ng calf intestinal phosphatase for I 
50 hour at 37° C phenol extracted and ethanol precipitated. The linearized plasmid (50 ng) was ligated with 250 ng 
Hind III C SV40 in 16 ul buffer C for 3 hours at 12° C, using 200 units T 4 polynucleotide ligase. and transformed 
into E. coli HBIOI. A 700 base pair Eco Rl-Kpn I fragment (fragment B) was then isolated from this plasmid. 

For the final construction of pD3, fragments A and B (50 ng each) were ligated with 10 ng fragment C with 200 
units T 4 polynucleotide ligase for 4 hours at 12° C, followed by transfection of E.coli RRI. Positive colonies were 
55 detected by rapid preparation analysis, and a large-scale preparation of pD3 was made. 

Plasmid pD3 is modified to accept the insertion of the protein C sequence by converting the Bel I insertion 
site to an Eco Rl site. It is first necessary to remove the Eco Rl site present in pD3 at the leftmost terminus of 
the adenovirus 5 0-1 map unit sequences by converting it to a Bam HI site via conventional linkenng 
procedures. Briefly, the plasmid is digested with Eco Rl and the linearized DNA treated with T 4 DNA 
60 polymerase and all four deoxynucleotide triphosphates to generate blunt termini. The plasmid is then ligated to 
octonucleotides comprising the Bam HI restriction site, the DNA digested with Bam HI to remove excess 
linkers and the fragment comprising the mammalian cell expression sequences is cloned into the Bam HI site 
of pML-l The resultant plasmid is transformed into E. coli HBIOI, and plasmid DNA is prepared and screened 
for the correct conversion, in a similar manner, the Bel I site is converted to an Eco Rl site using appropriate 
65 octonucleotide linkers. The resultant vector is known as pD7. The 1.5 kb protein C Eco Rl fragment from pMMC 
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is then inserted into the Eco Rl site of pD7 to produce the expression vector pD7C (Figure 10) 
A vector enabling expression of the protein C sequence from a polycistronic message is constructed bv 

using pD5, a plasmid similar to pD3 which contains a DHFR coding sequence lacking most of the 5' 

non-coding region. The DHFR sequence is further modified to reduce its binding affinity to methotrexate 
The vector P D5 is constructed by a method analogous to that described for pD3, and differs from P D3 only in s 

that a Bam HI site is the site of insertion of heterologous DNAs. and that the BC l-Bam: HI SV40 fragment 

containing the SV40 polyadenylation signal is in the late orientation. 

n J™ ? HFR : n °. e iS rT10dmed bv Aral digesting pDHFRIII with Pst I and Sst I and isolating the 400 bp 
DHFR fragment. This is subcloned in an MI3 phage vector and mutagenized as described by Simohsen and 

£ rSS? £ 5* ^ A ~ : 2495 ' 24 "' ' 983) - Muta 9 enesl * in a single base pair S££?Z m 

%Z „ sequence. The altered fragment is then reinserted into pDHFRIII to produce plasmid pDHFRrlll 
aJ, t non - codln 9 re 9' on of tne DHFR sequence is then removed. Plasmid pDHFRrlll is cleaved with Fnu 
4HI. which cuts the plasmid at approximately 20 sites, then treated with T 4 DNA polymerase and all four 
deoxynucleotide triphosphates to generate blunt termini. Bam HI linkers are llgated to the ends and the 
mixture digested with Bam HI and Nco I. A 0.6 kb Bam Hl-Nco I fragment comprising the DHFRr' oDNA is 
isolated. Plasmid pDHFRIII is digested with Nco I and Bam HI and the 0.2 kb fragment comprising the SV40 
polyadenylation signal is isolated. The polyadenylation signal, in the early orientation, is then ligeted to the 
DHFRr fragment After digestion with Bam HI, the resultant Bam HI fragment is then inserted into the Bam HI 
site of pD5 and the ligation mixture used to transform E. coli HBI0I. Plasmid DNA is prepared and screened bv 
restriction endonuclease digestion. A plasmid having the DHFRr insert in the correct orientation for 
transcnption from the Ad2 major late promoter is designated pD5(DHFRr) 

To express protein C using plasmid pD5(DHFRr), pMMC Is digested with Eco Rl and the 1 5 kb protein C 
fragment is isolated. The Eco Rl termini are converted to Bel I termini by linkering Plasmid pD5(DHFRr) is 
partially digested with Bam HI to cleave it at the 5' and of the DHFRr sequence and is Iigated to the protein C 
fragment. Plasmid DNA is screened for the proper orientation and insertion of the protein C fragment The 25 
resultant vector, designated pD5(PC-DHFRr). is illustrated in Figure II. ^ 25 

Example 4 

Expression of Protein C in Transfected Mammalian Cells 

Baby hamster kidney (BHK) cells (American Type Culture Collection accession number CCU0) are 
transfected with pD7C by calcium phosphate co-precipitation (Wigler et al . Cell 14: 725 I978- Corsaro and 
Pearson, Somatic Cell Genetics 7: 603, 198I; and Graham and Van der Eb. Virolog 752: 456 I973)' The cells are 
9 ^ 8 ! 3 /°°- 50/0 ?° 2 in Dulbecco ' s medium (plus lOO/o heat-inactivated fetal caifierum and supplemented 
with L-glutamine and penicillin-streptomycin) in 60 mm tissue culture Petri dishes to a confluency of 20% A 
total of 10 ug of DNA is used to transfect one 60 mm dish: 3.75 ug of pD7C, I.25 ug of pKO-neo (Southern and 
Berg, ^AppUGenet 1: 327-341. 1982) and 5 ug of salmon spe'rm DNA. The DNAs are pS*S2SS£2 
NaOAc, 750/o ethanol. rinsed with 70% ethanol and redissolved in 20 ul 10 mM Tris-HCI pH8. 1 mM EDTA The 

7 io c° m , 8d f ^ ~ 2 ° a " d 500 ° f 280 mM NaCI - 15 mM NaHPO*. 12 mM dextrose, 50 mM HEPES 
pH 7.12. Sixty ul of 250 mM CaCI 2 are added dropwfse to the above mixture and the solution is allowed to stand 
at roomtemperature for 30 minutes. The solution is then added to the cells and the cells returned to 37" C for 4 
hours. The medium is removed and 5 ml of 20°/o DMSO in Dubecco's with serum are added for 2 minutes at 
room temperature The dish is then washed rapidly with 2 changes of medium and incubated in fresh medium 
overnight. Twenty-four hours after the addition of the DNA. the medium is removed and selective medium (10 
mg/ml of G4I8. 498 u/mg, G.bco, in Dulbecco's with serum) added. After approximately 10-13 days, individual 
clones, representing cells that have incorporated the pKO-neo gene and are thus resistant to G418 are 
transferred to 96-well plates and grown up for protein assays in Dulbecco's plus 10% fetal celf serum 

To assay for protein C. the medium is separated from the cells and cellular debris by centrifugation and 
assayed for protein C polypeptide and biological activity. The cells are removed from the plates with trypsin 

. « w,tt \ fresn medium centrifuged and frozen at -20°C. For assay, the cell pellets are thawed in PBs' so 
pe leted and resuspended in PBS containing 0.25o/o Triton X-100. Samples are diluted and assayed for 
polypeptide and activity. ' or 

The enzyme-linked immunosorbent assay (EUSA) for protein C is done as follows: Affinity-purified 
polyclonal an .body to human protein C (100 ul of I ug/ml in 0.1 M Na 2 C0 3 , pH 9.6), is added to each well of 
^tTonc « if P ,ates u and * n e Plates are incubated overnight at 4°C. The wells are then washed three times 55 
™ i u Phosphate buffer. pH 7.5. 0.15 M NaCI) containing 0.050/o Tween-20 and then incubated with 
£L!^To 0 °e ,ne !! m T aJb " min ' 0 05C/b Tw6en 20 in PBS at 4 ° c overnight. The plates are then rinsed several 
^ ?k ?, ' St0r8d at 4 ° C " To ***** sam P'es. 100 ul of each sample is incubated for I hour at 

,L . h« • ^ -T? We " S "* rinSed W " h 0 050/0 Tween - 20 in PBS - Toe P"ates are then incubated 

for I hour at 37 C with a biot.n-conjugated sheep polyclonal antibody to protein C (30 ng/ml) in PBS containing so 
10/0 bovine serum albumin and O.OSo/o Tween-20. The wells are rinsed with PBS and incubated again for I hour at 
37 C with avidin conjugated to aJkaiine phosphatase in PBS containing K>/ 0 bovine serum albumin and 0 050/0 
Tween-20. The wells are then rinsed with PBS, and alkaline phosphatase activity is measured by the addition of 
100 ul of phosphatase substrate (Sigma 104; 600 ug/ml) in 10% dietnanolamine. pH 9.8, containing 0 3 mM 
MgCI 2 . The absorbance at 405 nm is read on a microtiter plate reader S5 
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Protein C biological activity is assayed by its ability to prolong the kaolin-cephalin clotting time of plasma 
following rts activation as described in Example 5C. 

Example 5 

5 

Expression of a Full Length cDNA Encoding Protein C 
A. Isolation of cDNA . 

A genomic fragment containing an exon corresponding to amino acids -42 to -19 of the pre-pro peptide 
10 (Exon I in Figure 4) of protein C was isolated, nick translated, and used as a probe to screen a cDNA library 
constructed by the technique of Gubler and Hoffman ( Gene 25: 263-269. 1983) using mRNA from HEPG2 cells. 
This cell line was derived from human hepatocytes and was previously shown to synthesize protein C (Fair and 
Bahnak, Blood 64: 194-204, 1984). Ten positive clones comprising cDNA inserted into the Eco Rl site of phage 
Xgtil were isolated and screened with an oligonucleotide probe corresponding to the 5' non-coding region of 
15 the protein C gene. One clone was also positive with this probe and its entire nucleotide sequence was 
determined. The cDNA contained 70 bp of 5' untranslated sequence, the entire coding sequence for human 
prepro-protein C, and the entire 3' non-coding region corresponding to the second polyadenylation site 
(Figure 2). 

20 B. Expression Vector Construction . 

The expression of protein C cDNA was achieved in the vector pDX. This vector was derived from pD3 
(described in Example 3 above) and pD3' f a vector identical to pD3 except that the SV40 polyadenylation signal 
(i.e., the SV40 BamHl [2533 bp] to Bel I [2770 bp] fragment) is in the late orientation. Thus, pD3' contains a Bam 
HI site as the site of gene insertion. 

25 To generate pDX, the Eco Rl site in pD3' was converted to a Bel I site by Eco Rl cleavage, incubation with SI 
nuclease, and subsequent ligation with Bel I linkers. DNA was prepared from a positively identified colony, and 
the I.9 kb Xho l-Pst I fragment containing the altered restriction site was prepared via agarose gel 
electrophoresis. In a second modification. BCI l-cleaved pD3 was ligated with kinased Eco Rl-Bcl I adaptors 
(constructed from oligonucleotides ZC525, 5'GGAATTC3'; and ZC526, 5'GATCAGAATTCC3') in order to 

30 generate an Eco Rl site as the position for inserting a gene into the expression vector. Positive colonies were 
identified by restriction endonuclease analysis, and DNA from this was used to isolate a 2.3 kb Xho l-Pst I 
fragment containing the modified restriction site. The two above-described DNA fragments were incubated 
together with T* DNA ligase, transformed int o E. coii HB101 and positive colonies were identified by restriction 
analysis. A preparation of such DNA, termed pDX, was then made. This plasmid contains a unique Eco Rl site 

35 for insertion of foreign genes. 

The protein C cDNA was then inserted into pDX as an Eco Rl fragment. Recombinant ptasmids were 
screened by restriction analysis to identify those having the protein C insert in the correct orientation with 
respect to the promoter elements and plasmid DNA (designated pDX/PC) was prepared from a correct clone 
(Figure 12). Because the cDNA insert in pDX/PC contains an ATG codon in the 5' non-coding region (see 

40 Figure 2), deletion mutagenesis was performed on the cDNA prior to transection and expression 
experiments. Deletion of the three base pairs was performed according to standard procedures of 
oligonucleotide-directed mutagenesis. The pDX-based vector containing the modified cDNA was designated 
p594. . 

45 C. cDNA Expression . 

Plasmid p594 was-transfected into COS-! (ATCC CRU650), BHK and 293 cells by calcium phosphate 
precipitation. Four hours later, fresh culture media (supplemented with 5 ug/ml vitamin K) were added. At 
appropriate times (usually 48 or 72 hours), the culture media were harvested and the cells were collected and 
lysed. 

50 The protein C secreted into the culture medium was assayed by ELISA using the same affinity-purified 
polyclonal antibody which was used in the initial identification of the cDNA clones. Results of the assays of 
COS-I cells. 

To assess the extent of gamma-carboxyiation of the recombinant protein, samples of the culture media 
were subjected to barium citrate precipitation, a process which selectively precipitates only gamma-carboxy- 

55 lated proteins from plasma (Bajaj et al., J. Biol. Chem. 256 : 253-259, 1981). Over 70<Vo of the protein C antigenic 
material could be precipitated with barium citrate. 

The recombinant protein C was assayed for anti-coagulant activity by measuring its ability to prolong 
coagulation. Dialyzed media samples were treated with Protac C (American Diagnostica) to activate the 
protein C. The samples were then added to an in vitro clotting assay (Sugo et al., J. Biol. Chem. 260 : 10453, 

60 1985). Briefly, 50 ul each of normal pooled human plasma, rabbit brain cephalin (10 mg/ml in TBS [50mM Tris pH 
7.5, 150 mM NaCI]) and kaolin suspension (5 mg/ml in TBS) were mixed in a siliconized glass tube. After 
preincubation at 37° C for 2 minutes, 100 ul of activated protein C diluted in TBS was added and the 37° C 
incubation was continued for an additional 2 minutes. Clotting was then initiated by the addition of 50 ul of 25 
mM CaCl2. and the clotting time was recorded. The activity of the recombinant material was shown to be 

65 essentially the same as that of naturally occurring protein C. 
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Protein C produced by transfected BHK and 293 cells was further analyzed by Western blotting. Media 
samples were electrophoresed on denaturing gels and blots were prepared and probed with radiolabeled 
antibody to protein C. Results indicated that about 20<>/o of the protein C from BHK ceils was in the two-chain 
form, while about 900/o of that from 293 cells was processed to the two-chain form. 

5 

TABLE 2 

TRANSIENT EXPRESSION AND SECRETION OF PROTEIN C 

IN COS-1 and 293 CELLS w 

ng/ml 
protein C 

cells plasmid in media 15 

COS-1 none 0 

COS-1 P 594 , 10 20 



293 



none 0 



293 p 594 so 



Example 6 



25 



Expression of Activated Protein C 30 

The cDNA sequence for protein C was altered by site-specific mutagenesis to delete the portion encoding 
the activation peptide. The altered sequence was then transfected into BHK and 293 cells and stabiy 
transfected cells were selected. Active protein C was detected in culture media samples from both cell lines. 

To delete the activation peptide coding sequence, plasmid p594 was digested with Sst I and the -880 bp 
fragment was purified and inserted into the Sst I site of MI3mpl0. The 12 activation peptide codons were deleted 
by oligonucleotide-directed deletion mutagenesis (Zo)ler and Smith, DNA 3 : 479-488 1984) using the 
mutagenic oligonucleotide 5'CTGAAACGACTCATTGAT3\ Replicative form DNA was prepared from mutant 
phage clones and digested with Sst I. The protein C fragment ( -840 bp) was isolated and inserted into Sst I 
digested p594. The resultant plasmids were screened for proper orientation of the Sst I fragment by restriction 
mapping using Bgl II A correct plasmid was selected and designated pPC829. Plasmid pPC829 was sequenced 
to verify the presence of the desired coding sequence. 

Plasmid pPC829 was co-transfected into BHK cells (with plasmid pSVDHFR (Lee et ah, Nature 294: 228-232 
1981)) and 293 cells (with pKO-neo (Southern and Berg, J. Mol. Appl. Genet. I: 327-341, I982)7by calcium 
phosphate coprecipitation (Graham and van der Eb, Virology 52: 456-467, 1973). After 48 hours, culture media 
were harvested and assayed for protein C by ELISA. Results are shown in Table 3. At the same time, cultures 
were split 1:5 into media containing 500 ug/mi of G4I8 (293 cells) or 250 nM methotrexate (BHK cells). After 10 
days in the presence of selective media, stably transfected colonies were screened for protein C production by 
immunofilter assay (McCracken and Brown, BioTechniques , 82-87, March/April 1984). Plates were rinsed with 
PBS or No Serum medium (Dulbeccb's plus penicillin-streptomycin, 5 ug/ml vitamin K). Teflon® mesh was then 
placed over the cells. Nitrocellulose, filters were wetted with PBS or No Serum medium, as appropriate, and 
placed over the mesh. After four hours* incubation at 37° C, filters were removed and placed in buffer A (50 mM 
Tris pH 7.4, 5 mM EDTA, 0.05<Vo NP-40, 150 mM NaCI. 0.250/o gelatin) for 30 minutes at room temperature. The 
filters were incubated for I hour at room temperature, with shaking, in biotin-labeied sheep polyclonal antibody 
to protein C, I ug/ml in buffer A. Filters were then washed in buffer A and incubated I hour at room temperature 
with shaking, in avidin-conjugated horseradish peroxidase (Boehringer-Mannheim), 1:1000 in buffer A Filters' 
were washed in buffer B. then in H2O, and incubated in color reagent (60 mg HHP color development reagent 
[Bio-Rad], 20 ml methanol, 100 ul H2O2 in 100 ml 50 mM Tris pH 7.4, 150 mM NaCI). The reaction was stopped by 
transferring the filters to H2O. 

Positive colonies were picked and grown in selective media (containing 500 ug/ml G4I8 or 250 nM 
methotrexate, as appropriate) for 10 days. Culture media were assayed for APC activity by chromogenic assay 
Media samples were added to microtiter wells containing 100 ul of 0.2 mM Spectrozyme PCa (American 
Diagnostica #336) in 50 mM Tris pH 7.5, 150 mM NaCI. Plates were incubated at 37° C and the A 40 6 measured 
at vanous time intervals. Representative results from one transfected 293 cell line (designated 829-20) are 
shown in Figure 13. Media from positive colonies of line 829-20 consistently showed higher activity with the 
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chromogenic substrate for APC than did control media which had been incubated with non-transfected 293 
cells tor the same length of time (10 days). 



10 



TABLE 3 

TRANSIENT EXPRESSION OF ACTIVATED PRtrTEIN C (ELTSA) 

Protein C 

Cell Line nq/ml in Media 



15 



BHK 
293 



2-7 
30 



20 



25 



30 



35 



40 



45 



Example 7 

Modification of the Protein C Processing Site 

A. Site-Specific M utagenesis . . 

To enhance the processing of single-chain protein C to the two-chain form, two additronal arginme residues 
were introduced into the protein, resulting in a cleavage site consisting of four basic amino acids. The resultant 
mutant precursor of protein C was designated PC962. It contains the sequence Ser-His-Leu-Arg-Arg-Lys-Arg- 
Asp at the cleavage site. Processing at the Arg-Asp bond results in a two-chain protein C molecule 

The mutant molecule was generated by altering the cloned cDNA by site-specific mutagenesis (essentially 
as described by Zoller and Smith, DNA 3: 479-488, I984, for the two-primer method) using the ™tagenic 
oligonucleotide ZC962 (5'AGTCACCTGAGAAG AAAACGAGACA3') . Plasmid p594 was digested with Sst I and 
the approximately 87 bp fragment was cloned into MI3mpII and single-stranded template DNA was iso ated. 
Following mutagenesis, a correct clone was identified by sequencing. Replicative form DNA was isolated 
digested with Sst I, and the protein C fragment was inserted into Sst l-cut p594. Clones having the Sst 
fragment inserted in the desired orientation were identified by restriction enzyme mapping. The resulting 
expression vector was designated pDX/PC962 (Figure 14). 

B. Expression and Characteri zation of Protein C. ~ „ 
Plasmid P DX7PC962 was c o-transfected into tk- BHK cells with pSV2-DHFR (Subramani et ah, MoK Ceil 

Biol 1*854-864, 1981) by the calcium phosphate procedure (essentially as described by Graham and van der Eb 
Ibid")" The transfected cells were grown in Dulbecco's modified Eagle's medium (MEM) containing i00/o fetal 
calf serum, IX PSN antibiotic mix (Gibco 600-5640), 2.0 mM L-glutamine and vitamin K (5 ug/ml). The cells were 
selected in 250 nM methotrexate (MTX) for 14 days, and the resulting colonies were screened by the 
immunofilter assay (Example 6). Six of the most intensely reacting colonies were picked by cylinder cloning 
and grown individually in 10-cm plates. When the cultures were nearly confluent, protein C production levels 
were measured by EUSA. Results are given in Table 4. 



TABLE 4 



50 



Clone C e 1 1 Number (x lOz Z ) EL ISA n g/m 1 pq/cell/dag 



55 



60 



962-1 


1 . 1 


2500 


2.20 


-2 


0.8 


1250 


1.56 


-3 


1.2 


1350 


1.12 


-4 


1.2 


550 


0.46 


-5 


1.2 


1550 


1.30 


-6 


1.2 


950 


0.80 
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The clone BHK/962-I was grown in larger scale culture, and several hundred micrograms of protein C were 
purified by affinity chromatography on a column prepared by coupling 7 mg of polyclonal sheep antibody 
against human protein C to 2 grams of CNBr-activated Sepharose 4B (Pharmacia, lnc. t Piscataway. NJ) Cell 
culture medium was applied to the column, the column was washed with I00 ml TBS, and the protein C was 
eluted with TBS containing 3 M KSCN or with pH II.5 buffer. Western blot analysis demonstrated that the 5 
mutant protein C was approximately 950/o in the two-chain form, compared to about 20<Vb two-chain protein C 
obtained from BHK cells transfected wtth the native sequence. 

Milligram quantities of protein C were purified from either stable BHK cell clones expressing the PC962 
mutant protein or stable 293 cell clones expressing the wild-type protein C (p594 transfected cells) using a 
monoclonal antibody column specific for the calcium-induced conformation of protein C. Cell culture media 10 
were applied to the column in the presence of 5 mM CaC! 2 , the column was washed with TBS containing 5 mM 
CaCI 2l and the protein C was eluted with TBS containing 10 mM EDTA. The use of this purification method 
permitted purification of completely active protein C without exposure to denaturing conditions. The purified 
protein C was analyzed by SDS/PAGE followed by silver staining and was shown to be >95<Vo pure 

The BHK-produced PC962 protein was assayed for its ability to be activated to a form which shows both 15 
amidolytic and anticoagulant activities. Affinity-purified protein samples were exhaustively dialyzed against 
TBS, then activated by incubation at 37° C for I hour with O.I volume of I unit/ml Protac C (American 
Diagnostica). Amidolytic activity was measured by adding aiiquots of the activation mixture to I00 ul of I mM 
protein C substrate (Spectrozyme PCa, American Diagnostica) in a microtiter well and measuring the change 
in A405 over time using a microtiter plate reader. Anticoagulant activity of the activated protein C was assayed 
as descnbed by Sugo etai. (ibid.). The affinity-purified pC962 protein was demonstrated to be fully active in 
both amidolytic and anticoagulant assays. Elution from the antibody column with pH II.5 buffer was shown to 
yield a protein with higher activity than that obtained using 3 M KSCN elution. 

Clonal cell lines from the pDX/PC962 transaction into BHK cells were also isolated by a process of limiting 
dilution. One plate of MTX-selected colonies (approximately 300 colonies) was trypsinized, counted and 
re-plated into microtiter wells at an average of 0.5 cell/well. These were grown up in selective media containing 
250 nM MTX. About 500/ 0 of the welis contained colonies. Wells containing identifiable colonies (I-2 mm 
diameter) were assayed by ELISA for protein C level in the media. For this assay, fresh medium was added to 
all the wells, allowed to incubate for 75 minutes, then removed and assayed. Five colonies which gave 
75-minute accumulations of greaterthan 50 ng/ml (corresponding to over 1000 ng/ml/day) were split into lOcm 30 
plates for larger scale culture. Protein C production levels for these clones ranged from LI to 2.8 pg/cell/day 

A second plasmid, designated PC962/229, was constructed by inserting the PC962 cDNA into plasmid 
Zem229. Zem229 is a pUC!8-based expression vector containing a unique Bam HI site for insertion of foreign 
DNA between the mouse metallothionein-l promoter and SV40 transcription terminator. 2em229 aJso contains 
an expression unit of the SV40 early promoter, mouse dihydrofolate reductase gene, and SV40 terminator An 35 
Eco Rl fragment containing the pC962 cDNA from pDX/PC962 was iigated, with Eco Rl-Bam Hi synthetic 
oligonucleotide adaptors, to Zem229, which had been cut with Bam HI and treated with phosphatase The 
resulting vector is PC962/229, illustrated in Figure 14. 

Plasmid PC962/229 was transfected into BHK cells by the calcium phosphate method. Cells were cultured in 
Dulbecco's MEM containing I0o/ 0 fetal calf serum and 5 ug/ml vitamin K. The 48-hour transient expression level 40 
from this transfection was approximately 25 ng/ml. After 2 days, the transfected cells were split into selective 
media containing 250 nM MTX and cultured for an additional I4 days. Three plates from this transfection 
containing approximately 200 colonies each, were screened by the immunofilter assay, and the 24 most 
intensely reacting colonies were picked by cylinder cloning. These were grown individually in 10-cm plates and 
their protein C production levels were measured. Colonies producing between I.I and 2.3 pg/cell/day were 
used for the production of stable protein C-producing cell lines. 

Expression vector pDX/PC962 and plasmid pKO-neo were co-transfected by the calcium phosphate 
method into 293 ceils. Transfected cells were split into media containing 500 ug/ml G4I8 after 48 hours. After 10 
days in selective media, immunofilter assays were done and two clones were picked by cylinder cloning 
Protein C production was found to range from I to 2 pg/cell/day. The cultures were scaled up. and protein C 
was purified by immuno-affinity chromatography. Greater than 95<Vb of the protein C was found to be in the 
two-chain form. 

The structure of the 962 mutant protein prepared from BHK and 293 cells was compared to that of wild-type 
protein C from 293 cells and from plasma. Analysis by SDS/PAGE followed by silver staining showed that all 
the recombinant proteins contained heavy and light chains which co-migrated with those of the plasma 55 
protein. The wild-type protein C synthesized in 293 cells contained a significant amount (approximately 200/0) 
of single-chain, unprocessed protein of Mr-66,000, whereas the mutant protein produced in either cell type 
was essentially completely processed to two chains. N-terminal sequence analysis showed that both the light 
and heavy chains of the recombinant wild-type and BHK/PC962 mutant proteins were prpperly processed. The 
extent of gamma carboxylation of the recombinant proteins was measured by two distinct ELISA systems The 60 
first system recognizes both gamma-carboxyiated and non-carboxyiated forms of the protein, while the 
second utilizes specific antibodies which only recognize protein C which has undergone a gla-induced 
conformational change in the presence of calcium. Analysis indicated that approximately 6OO/0 of the 
recombinant protein C produced in BHK cells and 9fJOA>-95o/o of that produced in 293 cells was sufficiently 
gamma carboxylated to be recognized by the specific antibodies. 55 
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The three recombinant proteins were also analyzed for amidolytic and anticoagulant activity and the results 
were compared to the activity of plasma protein C. PC962 from BHK cells and wild-type protein C from 293 
cells both showed full amidolytic activity. In the anticoagulant assay, protein C from BHK cells had essentially 
the same specific activity as plasma protein C, whereas both wild-type and PC962 mutant proteins from 293 
5 cells consistently exhibited approximately 40% greater specific activity. 

C. Modification of Activated Protein C Processing Site. 

A DNA sequence encoding an activated protein C precursor with the processing site sequence 
Arg-Arg-Lys-Arg was constructed by mutagenesis of the wild-type protein C sequence. The resultant 

10 sequence was analogous to that in pPC829, but lacked the portion ecoding the activation peptide. 

The protein C sequence present in plasmid p594 was altered in a single mutagenesis to delete the codons 
for the activation peptide and insert the Arg-Arg codons at the processing site. Mutagenesis was performed 
on the 870 bp Sst I fragment from p594 essentially as described in Example 7A using an oligonucleotide having 
the sequence 5' CGC AGT CAC CTG AGA AGA AAA CGA CTC ATT GAT GGG 3'. 

15 The mutagenized sequence was used to construct expression vector pDX/PCI058 and the vector was 
co-transfected into BHK cells as described in Example 7B. The protein was purified on a polyclonal antibody 
column eluted with pH IL5 buffer. 

The activity of the I058 protein was compared to that of activated plasma protein C and activated PC962. 
Plasma protein C and PC962 (5 jig/ml) were activated by treatment with l/IO volume Protac C (American 

20 Diagnostica) for two hours. Anticoagulant activity was assayed by combining 50 uJ human plasma with 50 \xi 
activated protein C and incubating the mixtures at 38° C for 150 seconds. To this mixture was added 50 pJ 
activated cephaloplastin (American Scientific Products, McGaw Park, IL) and the mixture was incubated at 
37° C for 300 seconds. One hundred p.1 of 20 mM CaCl2 was added and the clotting time was recorded. Data 
are presented in Figure 15. 

25 

Example 8 

Use of the Factor VII and Prothrombin Pre-Pro Peptides to Secrete Protein C 
The factor VII pre-pro peptide was substituted for the protein C pre-pro peptide in an effort to obtain higher 
30 yields of properly processed protein C. These hybrid constructs are then inserted into suitable expression 

vectors and transfected into mammalian cell lines. 

A cDNA encoding factor Vll has been described (Hagen et aI M Proc. Natl. Acad. Sci. USA 83: 2412-2416, 1986). 

Clone HV1I565 comprises the coding sequence for a 38 amino acid pre-pro peptide. This coding sequence was 

isolated as an Eco Rl-Hha I fragment of 140 bp. 
35 The protein C sequence was isolated from p594 by partial cleavage with Sst I and complete digestion with 

Eco Rl. A 1540 bp fragment extending from the Sst I site at codon +7 to the Eco Rl site 3 r to the cDNA was 

isolated. 

The factor Vll and protein C sequences were then joined by means of an oligonucleotide linker which 
completes the coding sequence for amino acids -3 to -I of the factor vli pre-pro peptide and amino acids 1-8 of 

40 protein C. The linker was constructed from two oligonucleotides having the sequences 5'CCGGCGCGCCAA- 
Ci'CCTTCCTGGAGGAGCT3' and 5'CCTCCAGGAAGGAGTTGGCGCGCCGGCG3'. The two oligonucleotides 
were annealed and joined, in a four-part ligation, to the factor Vll pre-pro sequence, protein C cDNA and pUC9 
which had been cleaved with Eco Rl and treated with bacterial alkaiine phosphatase. The ligated DNA was 
used to transform E. coll (JM 101). Plasmid DNA was prepared and screened for the presence of a 1710 bp Eco Rl 

45 fragment. A correct clone was designated p7/C-IO. 

The factor VU/protein C fusion was expressed in 293 cells. The Eco Rl insert from plasmid p7/C-!0 was 
ligated to Eco Rl-digested pDX. The resulting expression vector was used to co-transfect 293 cells as 
previously described. Forty-eight hour expression levels were assayed by ELISA and compared to those of 
293 cells transfected with the wild-type protein C expression construct and untransfected cells. Results are 

50 presented in Table 5. 

TABLE 5 



55 Protein ng/ml 

Factor VU/protein C 123 
60 Wild-type protein C 187 

Control <1 

The prothrombin leader sequence was constructed from the oligonucleotides listed in Table 6 and fused to 
65 the mature protein C coding sequence. The oligonucleotides were kinased by combining 50 ng of each 
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teS« ri °1^7^ ' ° f T t kinaS6 iR 2 ° ° f kinase buffer containing I mM ATP. The reaction was allowed 
to proceed at 37 C for 30 m.nutes. then the mixture was heated to 65°C for 10 minutes to inactivate the kinase 

TABLE 6 

ZC 1323 5' CCT CCA GGA AGG AGT TGG CTC GCC GGA 3' 

ZC 1324 5' CGC GTC CGG CGA GCC AAC TCC TTC CTG GAG GAG 
CT 3' 



10 



15 



ZC 1378 5 ' AAT TCC ACC ATG GCT CAT GTG AG A GGA CTG CAA 

CTG CCT GGC TGC CTG GCT CTG GCT GCT CTG TGC AGC 
CTG GTG CAC AGC CAG CAT GTG TTC CTG GCT CCT CAG 20 
CAG GCC AGG AGC CTG CTG CAA 3' 

ZC 1379 5' CGC GTT GCA GCA GGC TCC TGG CCT GCT GAG GAG 25 
CCA GGA ACA CAT GCT GGC TGT GCA CCA GGC TGC ACA 
GAG CAG CCA GAG CCA GGC AGC CAG GCA GTT GCA GTC 
CTC TCA CAT GAG CCA TGG TGG 3' 30 



, « W f * 8n assemb led. Fifty ng of Eco Rl, Sst l-cut MI3mpl9 was combined with 2 5 ng 
each of the kinased oligonucleotides in 20 ul of Ix ligase buffer containing I mM ATP and 4 units of T 4 Hgase The 
m^' n Th ed a nL C f ° r 48 h ° UrS Hnd transf ormed into competent E. coll JMIO. cel.s. A c.ea* plaque 
c^stnlcSd phage DN A was prepared. DNA sequencing confirmed thatthTcorrect sequence had been 

The prothrombin leader was then joined to the protein C sequence. RF DNA was prepared from the phaoe 

d t ,nm9 Synt * eS i 2ed ' eader and 8 150 bp EC ° RI " Sst 1 fra 9 ment was delated. Plasmid X was 

d.gested to comptet.on wth Eco Rl and partially digested with Sst I and the 1540 bp protein C fragmerrt was 

™t re n t T r eSe l fra u 9 ^n, ntS r™ " 9ated With Ec ° R '- cut P° X and the «5 ati on mixture was usee ItoSSlSS 
™S^n JMr 5 "f 01 °f S - PlaSmid DNA was isolated from transfcrmant colonies and analyzed \Z 45 
restr.ct.on d gesfton to confirm that the fragments had been assembled in the correct orientation 

h«X.h I f/* 9 ?" 9 " be f P, reciate o t^t. although specific embodiments of the invention have been 
described herein for purposes of illustration, various modifications may be made without deviating from the 
sp.rrt and scope of the invention. According*, the invention is not to be limited except as by the appended 

The features disclosed in the foregoing description, in the following claims and/or in the accomoanvino 
b0th W and in any combination thereof, be material for realising the £ESS5!352 



Claims 



''t»,nT whjch codes for a protein having substantially the same biological activity as human 

Kr 2 R^Tp ^lo^ S3id sequence codin 9 f ° r »eln«no acidsequence 

EgiSZZXS!! 1 - ^ ™ ^ ° r ArS - "-».«. 2 or 3. at the cleavage site between 

*2^l£S^^£^ S3id amin ° SeqU8nCe iS ^-LYS-Arg. at the cleavage 
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3. A DNA sequence which codes for a protein having substantially the same biological activity as human 
protein C or human activated protein C, said sequence further coding for the pre-pro peptide of a protein 
selected from the group consisting of factor VII, factor IX, factor X, prothrombin and protein S. 

4. A DNA sequence which codes for a protein having substantially the same biological activity as human 
protein C or human activated protein C, said protein further including the substitution of residue I58 with a 
non-acidic amino acid residue selected from the group consisting of Ala, Ser, Thr, and Gly. 

5. A DNA sequence which codes for a protein having substantially the same biological activity as human 
protein C or human activated protein C, said protein further including the substitution of residue I54 with a 
basic amino acid residue selected from the group consisting of Lys or Arg. 

6. A DNA sequence which codes for a protein having substantially the same biological activity as human 
protein C or human activated protein C, said protein further including the substitution of Lys-Arg at 
residues I56-I57 with Lys-Lys or Arg-Arg. 

7. An expression vector capable of integration in mammalian host cell DNA, said expression vector 
including a promoter followed downstream by a DNA sequence according to any of claims I-6, said DNA 
sequence being followed downstream by a polyadenylation signal, wherein transcription of the DNA 
sequence is directed by the promoter. 

8. Mammalian cells transfected with an expression vector according to claim 7. 

9. The cells of claim 8 wherein said cells are selected from the group consisting of COS, BHK, Rat Hep I, 
Rat Hep II, TCMK, Human lung. Human hepatoma, Hep G2, Mouse liver, DUKX and 293 cells. 

10. A method for producing a protein which, upon activation, has substantially the same structure and/or 
biological activity as human activated protein C, comprising: 

introducing into a mammalian host cell an expression vector according to claim 7; 
growing said mammalian host cell in an appropriate medium; and 

isolating the protein product encoded by said expression vector and produced by said mammalian 
host ceil. 

11. The method of claim JO wherein said cell is selected from the group consisting of COS, BHK, Rat Hep I. 
Rat Hep II, TCMK, Human lung, Human hepatoma, Hep G2, Mouse liver, DUKX and 293 cells. 

12. A method for producing a protein which, upon activation, has substantially the same structure and/or 
biological activity as human activated protein C, comprising: 

introducing into a BHK host cell an expression vector capable of integration into BHK host cell DNA r 
said expression vector including a promoter followed downstream by a DNA sequence according to any 
of claims I, 5 or 6, said DNA sequence being followed downstream by a polyadenylation signal, wherein 
transcription of the DNA sequence is directed by the promoter; 

growing said BHK host cell in an appropriate medium; and 

isolating the protein product encoded by said DNA sequence and produced by said BHK host cell. 

13. The method of claims I0-I2, further comprising introducing into the host cell, with said expression 
vector, a selectable marker. 

14. The method of claims 10-12, further comprising the step of activating the protein product to produce a 
protein having substantially the same biological activity as human activated protein C. 

15. The method of claim 14 wherein the activating step comprises cleavage of the protein product with a 
protease selected from the group consisting of alpha-thrombin, trypsin, and venom activators. 
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GGCTGTCATG GCGGCAGG AC GGCGAACTTG CAGTATCTCC ACGACCCGCC CCTGT 
-42 -40 -30 

ATG TGG CAG CTC ACA AGC CTC CTG CTG TTC GTG GCC ACC TGG GGA ATT TCC GGC 

MET Trp Gin Leu Thr Sep Leu Leu Leu Phe Val Ala Thr Trp Gly lie Ser Gly 

-20 -10 

ACA CCA GCT CCT CTT GAC TCA GTG TTC TCC AGC AGC GAG CGT GCC CAC CAG GTG 

Thr Pro Ala Pro Leu Asp Ser Val Phe Ser Ser Ser Glu Arg Ala His Gin Val 

-1 +1 10 

CTG CGG ATC CGC AAA CGT GCC AAC TCC TTC CTG- GAG GAG CTC CGT CAC AGC AGC 

Leu Arg lie Arg Lys Arg Ala Asn Ser Phe Leu Glu Glu Leu Arg His Ser Ser 

20 30 

CTG GAG CGG GAG TGC ATA GAG GAG ATC TGT GAC TTC GAG GAG GCC AAG GAA ATT 

Leu Glu Arg Glu Cys lie Glu Glu lie Cys Asp Phe Glu Glu Ala Lys Glu lie 

no 

TTC CAA A AT GTG GAT GAC ACA CTG GCC TTC TGG TCC AAG CAC GTC GAC GGT GAC 

Phe Gin Asn Val Asp Asp Thr Leu Ala Phe Trp Ser" Lys His Val Asp Gly Asp 

50 60 

CAG TGC TTG GTC TTG CCC TTG GAG CAC CCG TGC GCC AGC CTG TGC TGC GGG CAC 

Gin Cys Leu Val Leu Pro Leu Glu His Pro Cys Ala Ser Leu Cys Cys Gly His 

70 80 

GGC ACG TGC ATC GAC GGC ATC GGC AGC TTC AGC TGC GAC TGC CGC AGC GGC TGG 

Gly Thr Cys lie Asp Gly lie Gly Ser Phe Ser Cys Asp Cys Arg Ser Gly Trp 

90 + 100 

GAG GGC CGC TTC TGC CAG CGC GAG GTG AGC TTC CTC AAT TGC TCG CTG GAC AAC 

Glu Gly Arg Phe Cys Gin Arg Glu Val Ser Phe Leu Asn Cys Ser Leu Asp Asn 

no 120 

GGC GGC TGC ACG CAT TAC TGC CTA GAG GAG GTG GGC TGG CGG CGC TGT AGC TGT 

Gly Gly Cys Thr His Tyr Cys Leu Glu Glu Val Gly Trp Arg Arg Cys Ser Cys 

130 

GCG CCT GGC TAC AAG CTG GGG GAC GAC CTC CTG CAG TGT CAC CCC GCA GTG AAG 
Ala Pro Gly Tyr Lys Leu Gly Asp Asp Leu Leu Gin Cys His Pro Ala Val Lys 

140 150 ^ 

TTC CCT TGT GGG AGG CjCC TGG AAG CGG ATG GAG AAG AAG CGC AGT CAC CTG AAA 

Phe Pro Cys Gly Arg Pro Trp Lys Arg Met Glu Lys Lys Arg Ser His Leu Lys 
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A GAC ACA GAA GAC CAA GAA GAC CAA GTA GAT CCG CGG^C 



J70 

CGA'GAC ACA GAA GAC CAA GAA GAC CAA GTA GAT CCG CGG^CTC ATT GAT GGG AAG 
Arg Asp Thr Glu Asp Gin Glu Asp Gin Val Asp Pro Arg Leu lie Asp Gly Lys 
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180 190 
ATG ACC AGG CGG GGA GAC AGC CCC TGG CAG GTG GTC CTG CTG G AC TCA AAG AAG 
Met Thr Arg Arg Gly Asp Ser Pro Trp Gin Val Val Leu Leu Asp Ser Lys Lys 

200 210 
AAG CTG GCC TGC GGG GCA GTG CTC ATC CAC CCC TCC TGG GTG CTG ACA GCG GCC 
Lys Leu Ala Cys Gly Ala Val Leu He His Pro Ser Trp Val Leu Thr Ala Ala 

220 

CAC TGC ATG GAT GAG TCC AAG AAG CTC CTT GTC AGG CTT GGA GAG TAT GAC CTG 
His Cys Met Asp Glu Ser Lys Lys Leu Leu Val Arg Leu Gly Glu Tyr Asp Leu 

230 240 
CGG CGC TGG GAG AAG TGG GAG CTG GAC CTG GAC ATC AAG GAG GTC TTC GTC CAC 
Arg Arg Trp Glu Lys Trp Glu Leu Asp Leu Asp lie Lys Glu Val Phe Val His 

♦ 250 260 

CCC AAC TAC AGC AAG AGC ACC ACC GAC AAT GAC ATC GCA CTG CTG CAC CTG GCC 
Pro Asn Tyr Ser Lys Ser Thr Thr Asp Asn Asp He Ala Leu Leu His Leu Ala 

270 280 
CAG CCC GCC ACC CTC TCG CAG ACC ATA GTG CCC ATC *TGC CTC CCG GAC AGC GGC 
Gin Pro Ala Thr Leu Ser Gin Thr He Val Pro He Cys Leu Pro Asp Ser Gly 

290 300 
CTT GCA GAG CGC GAG CTC AAT CAG GCC GGC CAG GAG ACC CTC GTG ACG GGC TGG 
Leu Ala Glu Arg Glu Leu Asn Gin Ala Gly Gin Glu Thr Leu Val Thr Gly Trp 

310 4 

GGC TAC CAC AGC AGC CGA GAG AAG GAG GCC AAG AGA AAC CGC ACC TTC GTC CTC 
Gly Tyr His Ser Ser Arg Glu Lys Glu Ala Lys Arg Asn Arg Thr Phe Val Leu 

320 ♦ 330 

AAC TTC ATC AAG ATT CCC GTG GTC CCG CAC AAT GAG TGC AGC GAG GTC ATG AGC 
Asn Phe He Lys He Pro Val Val Pro His Asn Glu Cys Ser Glu Val Met Ser 

3^0 350 
AAC ATG GTG TCT GAG AAC ATG CTG TGT GCG GGC ATC CTC GGG GAC CGG CAG GAT 
Asn Met Val Ser Glu Asn Met Leu Cys Ala Gly He Leu Gly Asp Arg Gin Asp 

360 370 
GCC TGC GAG GGC GAC AGT GGG GGG CCC ATG GTC GCC TCC TTC CAC GGC ACC TGG 
Ala Cys Glu Gly Asp Ser Gly Gly Pro Met Val Ala Ser Phe His Gly Thr Trp 

380 390 
TTC CTG GTG GGC CTG GTG AGC TGG GGT GAG GGC TGT GGG CTC CTT CAC AAC TAC 
Phe Leu Val Gly Leu Val Ser Trp Gly Glu Gly Cys Gly Leu Leu His Asn Tyr 

400 

GGC GTT TAC ACC AAA GTC AGC CGC TAC CTC GAC TGG ATC CAT GGG CAC ATC AGA 
Gly Val Tyr Thr Lys Val Ser Arg Tyr Leu Asp Trp He His Gly His He Arg 
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GAC AAG GAA GCC CCC CAG AAG AGC TGG GCA CCT TAG CGACCCTCCC TGCAGGGCTG 
Asp Lys Glu Ala Pro Gin Lys Ser Trp Ala Pro 

GGCTTTTGCA TGGCAATGGA TGGGACATTA AAGGGACATG TAACAAGCAC ACCGGCCTGC TGTTCTGTCC 

TTCCATCCCT CTTTTGGGCT CTTCTGGAGG GAAGTAAC AT TTACTGAGCA CCTGTTGTAT GTC AC ATGCC 

TTATGAAT AG AATCTTAACT CCTAGAGCAA CTCTGTGGGG TGGGGAGGAG CAGATCCAAG TTTTGCGGGG 

TCTAAAGCTG TGTGTGTTG A GGGGGAT ACT CTGTTTATGA AAAAGAATAA AAAACACAAC CACG AAAAAA 
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